Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 215094 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 93.2 MiB |
| Average record size in memory | 454.6 B |
Variable types
| NUM | 8 |
|---|---|
| CAT | 4 |
| BOOL | 1 |
Reproduction
| Analysis started | 2020-03-09 06:53:46.438910 |
|---|---|
| Analysis finished | 2020-03-09 07:04:00.507705 |
| Version | pandas-profiling v2.5.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
회원번호 has a high cardinality: 215094 distinct values | High cardinality |
회원이름 has a high cardinality: 150751 distinct values | High cardinality |
담당자 has a high cardinality: 2157 distinct values | High cardinality |
진행률 is highly correlated with 총불입액 | High Correlation |
총불입액 is highly correlated with 진행률 | High Correlation |
해약금액 has 176137 (81.9%) zeros | Zeros |
연체횟수 has 3066 (1.4%) zeros | Zeros |
상태 has 3665 (1.7%) zeros | Zeros |
| Distinct count | 215094 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120606.92464224943 |
|---|---|
| Minimum | 0 |
| Maximum | 234612 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12196.65 |
| Q1 | 64479.25 |
| median | 121871.5 |
| Q3 | 178243.75 |
| 95-th percentile | 222708.35 |
| Maximum | 234612 |
| Range | 234612 |
| Interquartile range (IQR) | 113764.5 |
Descriptive statistics
| Standard deviation | 66721.49409 |
|---|---|
| Coefficient of variation (CV) | 0.553214455 |
| Kurtosis | -1.159724153 |
| Mean | 120606.9246 |
| Median Absolute Deviation (MAD) | 57552.93947 |
| Skewness | -0.06307497504 |
| Sum | 2.594182585e+10 |
| Variance | 4451757774 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 11343.5 19639.5 36392.5 53461.5 ... 226652.5 228076.5 228248.5 228347.5 234612. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 213309 | 1 | < 0.1% | |
| 4439 | 1 | < 0.1% | |
| 6486 | 1 | < 0.1% | |
| 341 | 1 | < 0.1% | |
| 2388 | 1 | < 0.1% | |
| 14674 | 1 | < 0.1% | |
| 8529 | 1 | < 0.1% | |
| 10576 | 1 | < 0.1% | |
| 53583 | 1 | < 0.1% | |
| Other values (215084) | 215084 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 234612 | 1 | < 0.1% | |
| 234611 | 1 | < 0.1% | |
| 234610 | 1 | < 0.1% | |
| 234609 | 1 | < 0.1% | |
| 234607 | 1 | < 0.1% |
| Distinct count | 215094 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 218A041035 | 1 |
|---|---|
| 218A023141 | 1 |
| 214A000549 | 1 |
| 218A019777 | 1 |
| 217A004748 | 1 |
| Other values (215089) |
| Value | Count | Frequency (%) | |
| 218A041035 | 1 | < 0.1% | |
| 218A023141 | 1 | < 0.1% | |
| 214A000549 | 1 | < 0.1% | |
| 218A019777 | 1 | < 0.1% | |
| 217A004748 | 1 | < 0.1% | |
| 1022A11315 | 1 | < 0.1% | |
| 1022A00267 | 1 | < 0.1% | |
| 211A003889 | 1 | < 0.1% | |
| 1011A11517 | 1 | < 0.1% | |
| 216A006562 | 1 | < 0.1% | |
| Other values (215084) | 215084 | > 99.9% |
Length
| Max length | 11 |
|---|---|
| Mean length | 10.00073456 |
| Min length | 10 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 16 | 57.1% | |
| Decimal_Number | 10 | 35.7% | |
| Connector_Punctuation | 1 | 3.6% | |
| Dash_Punctuation | 1 | 3.6% |
| Value | Count | Frequency (%) | |
| Latin | 16 | 57.1% | |
| Common | 12 | 42.9% |
| Value | Count | Frequency (%) | |
| ASCII | 28 | 100.0% |
| Distinct count | 150751 |
|---|---|
| Unique (%) | 70.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 주식회사에프피에이110111 | 70 |
|---|---|
| 임덕길470529 | 24 |
| 대성건설(주)124-81 | 20 |
| 강성복551227 | 12 |
| 백승식720219 | 12 |
| Other values (150746) |
| Value | Count | Frequency (%) | |
| 주식회사에프피에이110111 | 70 | < 0.1% | |
| 임덕길470529 | 24 | < 0.1% | |
| 대성건설(주)124-81 | 20 | < 0.1% | |
| 강성복551227 | 12 | < 0.1% | |
| 백승식720219 | 12 | < 0.1% | |
| (주)지케이씨교역540605 | 10 | < 0.1% | |
| 김숙경550210 | 9 | < 0.1% | |
| 안규덕631212 | 8 | < 0.1% | |
| 김현순650213 | 8 | < 0.1% | |
| 조상영551213 | 8 | < 0.1% | |
| Other values (150741) | 214913 | 99.9% |
Length
| Max length | 25 |
|---|---|
| Mean length | 9.006541326 |
| Min length | 8 |
| Value | Count | Frequency (%) | |
| Other_Letter | 502 | 90.6% | |
| Uppercase_Letter | 26 | 4.7% | |
| Decimal_Number | 10 | 1.8% | |
| Lowercase_Letter | 9 | 1.6% | |
| Other_Punctuation | 3 | 0.5% | |
| Close_Punctuation | 1 | 0.2% | |
| Dash_Punctuation | 1 | 0.2% | |
| Open_Punctuation | 1 | 0.2% | |
| Space_Separator | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| Hangul | 502 | 90.6% | |
| Latin | 35 | 6.3% | |
| Common | 17 | 3.1% |
| Value | Count | Frequency (%) | |
| Hangul | 502 | 90.6% | |
| ASCII | 52 | 9.4% |
주소
Categorical
| Distinct count | 46 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 경기 | |
|---|---|
| 서울 | |
| 인천 | |
| 경상 | 11440 |
| 광주 | 9080 |
| Other values (41) |
| Value | Count | Frequency (%) | |
| 경기 | 59425 | 27.6% | |
| 서울 | 54498 | 25.3% | |
| 인천 | 16745 | 7.8% | |
| 경상 | 11440 | 5.3% | |
| 광주 | 9080 | 4.2% | |
| 부산 | 8535 | 4.0% | |
| 전라 | 8133 | 3.8% | |
| 강원 | 7861 | 3.7% | |
| 충청 | 7778 | 3.6% | |
| 대전 | 6562 | 3.1% | |
| Other values (36) | 25037 | 11.6% |
Length
| Max length | 2 |
|---|---|
| Mean length | 1.999958158 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Other_Letter | 46 | 93.9% | |
| Other_Punctuation | 1 | 2.0% | |
| Decimal_Number | 1 | 2.0% | |
| Space_Separator | 1 | 2.0% |
| Value | Count | Frequency (%) | |
| Hangul | 46 | 93.9% | |
| Common | 3 | 6.1% |
| Value | Count | Frequency (%) | |
| Hangul | 46 | 93.9% | |
| ASCII | 3 | 6.1% |
상품금액
Real number (ℝ≥0)
| Distinct count | 24 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.33326345141433056 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 40 |
| Zeros (%) | < 0.1% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1571254568 |
| Q1 | 0.2570036541 |
| median | 0.3909866017 |
| Q3 | 0.3909866017 |
| 95-th percentile | 0.4640682095 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1339829476 |
Descriptive statistics
| Standard deviation | 0.08800895179 |
|---|---|
| Coefficient of variation (CV) | 0.2640822191 |
| Kurtosis | -0.6663673791 |
| Mean | 0.3332634514 |
| Median Absolute Deviation (MAD) | 0.07639967687 |
| Skewness | -0.6135871896 |
| Sum | 71682.96882 |
| Variance | 0.007745575595 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.00609013 0.07369062 0.14616322 0.15834348 ... 0.4954933 0.51961023 0.54397077 0.78465286 1. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0.3909866017 | 92906 | 43.2% | |
| 0.3544457978 | 35183 | 16.4% | |
| 0.2082825822 | 31927 | 14.8% | |
| 0.2570036541 | 21790 | 10.1% | |
| 0.4640682095 | 15757 | 7.3% | |
| 0.1571254568 | 13124 | 6.1% | |
| 0.28136419 | 2357 | 1.1% | |
| 0.3179049939 | 848 | 0.4% | |
| 0.2375152253 | 268 | 0.1% | |
| 0.1352009744 | 234 | 0.1% | |
| Other values (14) | 700 | 0.3% |
| Value | Count | Frequency (%) | |
| 0 | 40 | < 0.1% | |
| 0.01218026797 | 3 | < 0.1% | |
| 0.1352009744 | 234 | 0.1% | |
| 0.1571254568 | 13124 | 6.1% | |
| 0.1595615104 | 56 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 10 | < 0.1% | |
| 0.5693057247 | 48 | < 0.1% | |
| 0.5493300853 | 1 | < 0.1% | |
| 0.5386114495 | 78 | < 0.1% | |
| 0.5006090134 | 26 | < 0.1% |
| Distinct count | 935 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1214973287554106 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 137 |
| Zeros (%) | 0.1% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.001895783163 |
| Q1 | 0.009069016755 |
| median | 0.04903417533 |
| Q3 | 0.1228160066 |
| 95-th percentile | 0.4917251627 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1137469898 |
Descriptive statistics
| Standard deviation | 0.1659612918 |
|---|---|
| Coefficient of variation (CV) | 1.365966589 |
| Kurtosis | 1.691116703 |
| Mean | 0.1214973288 |
| Median Absolute Deviation (MAD) | 0.1245652135 |
| Skewness | 1.649109372 |
| Sum | 26133.34643 |
| Variance | 0.02754315037 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 2.56186914e-05 1.02474766e-04 9.99128964e-04 1.71645232e-03 ... 6.77409438e-01 7.33975509e-01 7.68407030e-01 8.17594917e-01 1.00000000e+00], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0.001895783163 | 13853 | 6.4% | |
| 0.4917251627 | 13538 | 6.3% | |
| 0.005994773787 | 8544 | 4.0% | |
| 0.002920530819 | 7931 | 3.7% | |
| 0.4056463596 | 6289 | 2.9% | |
| 0.003945278475 | 5958 | 2.8% | |
| 0.01214325972 | 5126 | 2.4% | |
| 0.01829174566 | 5048 | 2.3% | |
| 0.003535379413 | 4548 | 2.1% | |
| 0.03673720346 | 3714 | 1.7% | |
| Other values (925) | 140545 | 65.3% |
| Value | Count | Frequency (%) | |
| 0 | 137 | 0.1% | |
| 5.123738279e-05 | 49 | < 0.1% | |
| 0.0001537121484 | 6 | < 0.1% | |
| 0.000256186914 | 7 | < 0.1% | |
| 0.0003074242968 | 6 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.993441615 | 1 | < 0.1% | |
| 0.9836040375 | 3 | < 0.1% | |
| 0.9508121125 | 1 | < 0.1% | |
| 0.9221191782 | 13 | < 0.1% |
| Distinct count | 1824 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02542004964891966 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 176137 |
| Zeros (%) | 81.9% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.2214525284 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.09126681381 |
|---|---|
| Coefficient of variation (CV) | 3.590347583 |
| Kurtosis | 16.95820195 |
| Mean | 0.02542004965 |
| Median Absolute Deviation (MAD) | 0.04496226425 |
| Skewness | 4.112052435 |
| Sum | 5467.700159 |
| Variance | 0.008329631302 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 8.12693498e-05 1.78018576e-04 2.06398349e-04 2.92827657e-04 ... 6.78205624e-01 6.83877709e-01 6.83880934e-01 6.97755418e-01 1.00000000e+00], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0 | 176137 | 81.9% | |
| 0.00257997936 | 7801 | 3.6% | |
| 0.00386996904 | 2470 | 1.1% | |
| 0.5015479876 | 1955 | 0.9% | |
| 0.004643962848 | 1538 | 0.7% | |
| 0.00515995872 | 1431 | 0.7% | |
| 0.4135706914 | 1119 | 0.5% | |
| 0.006191950464 | 1099 | 0.5% | |
| 0.007223942208 | 481 | 0.2% | |
| 0.00773993808 | 372 | 0.2% | |
| Other values (1814) | 20691 | 9.6% |
| Value | Count | Frequency (%) | |
| 0 | 176137 | 81.9% | |
| 0.0001625386997 | 17 | < 0.1% | |
| 0.000193498452 | 120 | 0.1% | |
| 0.0002192982456 | 16 | < 0.1% | |
| 0.000257997936 | 33 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 2 | < 0.1% | |
| 0.9453044376 | 1 | < 0.1% | |
| 0.9287925697 | 1 | < 0.1% | |
| 0.816127451 | 1 | < 0.1% | |
| 0.8125 | 1 | < 0.1% |
| Distinct count | 2157 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 더피플라이프 | |
|---|---|
| 금강종합상조(주) | |
| 강대석 | 6336 |
| 김영권 | 4857 |
| 이덕술 | 4749 |
| Other values (2152) |
| Value | Count | Frequency (%) | |
| 더피플라이프 | 72013 | 33.5% | |
| 금강종합상조(주) | 28707 | 13.3% | |
| 강대석 | 6336 | 2.9% | |
| 김영권 | 4857 | 2.3% | |
| 이덕술 | 4749 | 2.2% | |
| 김영경 | 4241 | 2.0% | |
| 제이앤지 | 1848 | 0.9% | |
| 심상열 | 1818 | 0.8% | |
| 고달진 | 1703 | 0.8% | |
| 안미나 | 1489 | 0.7% | |
| Other values (2147) | 87333 | 40.6% |
Length
| Max length | 11 |
|---|---|
| Mean length | 4.831027365 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Other_Letter | 252 | 96.9% | |
| Uppercase_Letter | 4 | 1.5% | |
| Decimal_Number | 2 | 0.8% | |
| Close_Punctuation | 1 | 0.4% | |
| Open_Punctuation | 1 | 0.4% |
| Value | Count | Frequency (%) | |
| Hangul | 252 | 96.9% | |
| Latin | 4 | 1.5% | |
| Common | 4 | 1.5% |
| Value | Count | Frequency (%) | |
| Hangul | 252 | 96.9% | |
| ASCII | 8 | 3.1% |
| Distinct count | 121 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.16695952312322368 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 3066 |
| Zeros (%) | 1.4% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.008333333333 |
| Q1 | 0.008333333333 |
| median | 0.008333333333 |
| Q3 | 0.2916666667 |
| 95-th percentile | 0.775 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.2833333333 |
Descriptive statistics
| Standard deviation | 0.2498663421 |
|---|---|
| Coefficient of variation (CV) | 1.496568374 |
| Kurtosis | 0.8985063815 |
| Mean | 0.1669595231 |
| Median Absolute Deviation (MAD) | 0.2032338943 |
| Skewness | 1.460052037 |
| Sum | 35911.99167 |
| Variance | 0.0624331889 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.00416667 0.0125 0.02083333 0.02916667 ... 0.8375 0.92083333 0.97916667 0.99583333 1. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0.008333333333 | 118228 | 55.0% | |
| 0.01666666667 | 4983 | 2.3% | |
| 0.8333333333 | 3586 | 1.7% | |
| 0 | 3066 | 1.4% | |
| 0.025 | 2372 | 1.1% | |
| 0.05 | 1885 | 0.9% | |
| 0.4833333333 | 1862 | 0.9% | |
| 0.03333333333 | 1758 | 0.8% | |
| 0.04166666667 | 1688 | 0.8% | |
| 0.425 | 1535 | 0.7% | |
| Other values (111) | 74131 | 34.5% |
| Value | Count | Frequency (%) | |
| 0 | 3066 | 1.4% | |
| 0.008333333333 | 118228 | 55.0% | |
| 0.01666666667 | 4983 | 2.3% | |
| 0.025 | 2372 | 1.1% | |
| 0.03333333333 | 1758 | 0.8% |
| Value | Count | Frequency (%) | |
| 1 | 149 | 0.1% | |
| 0.9916666667 | 83 | < 0.1% | |
| 0.9833333333 | 60 | < 0.1% | |
| 0.975 | 52 | < 0.1% | |
| 0.9666666667 | 42 | < 0.1% |
성별
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 113423 | 52.7% | |
| 0 | 101671 | 47.3% |
나이
Real number (ℝ≥0)
| Distinct count | 91 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.34908228518928963 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 112 |
| Zeros (%) | 0.1% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.101010101 |
| Q1 | 0.2525252525 |
| median | 0.3535353535 |
| Q3 | 0.4444444444 |
| 95-th percentile | 0.5858585859 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1919191919 |
Descriptive statistics
| Standard deviation | 0.1436689779 |
|---|---|
| Coefficient of variation (CV) | 0.4115619267 |
| Kurtosis | -0.4298260064 |
| Mean | 0.3490822852 |
| Median Absolute Deviation (MAD) | 0.116717409 |
| Skewness | 0.007568506106 |
| Sum | 75085.50505 |
| Variance | 0.0206407752 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.01515152 0.02525253 0.03535354 0.04545455 ... 0.88383838 0.89393939 0.91919192 0.99494949 1. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0.3939393939 | 6518 | 3.0% | |
| 0.3838383838 | 6261 | 2.9% | |
| 0.3737373737 | 6111 | 2.8% | |
| 0.3131313131 | 6033 | 2.8% | |
| 0.3434343434 | 5876 | 2.7% | |
| 0.3535353535 | 5852 | 2.7% | |
| 0.404040404 | 5840 | 2.7% | |
| 0.3232323232 | 5569 | 2.6% | |
| 0.4141414141 | 5525 | 2.6% | |
| 0.3333333333 | 5511 | 2.6% | |
| Other values (81) | 155998 | 72.5% |
| Value | Count | Frequency (%) | |
| 0 | 112 | 0.1% | |
| 0.0101010101 | 182 | 0.1% | |
| 0.0202020202 | 313 | 0.1% | |
| 0.0303030303 | 483 | 0.2% | |
| 0.0404040404 | 671 | 0.3% |
| Value | Count | Frequency (%) | |
| 1 | 24 | < 0.1% | |
| 0.9898989899 | 3 | < 0.1% | |
| 0.9797979798 | 1 | < 0.1% | |
| 0.9696969697 | 1 | < 0.1% | |
| 0.9393939394 | 2 | < 0.1% |
| Distinct count | 605 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.22027684045492693 |
|---|---|
| Minimum | 0.002564102564102564 |
| Maximum | 1.2512820512820513 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0.002564102564 |
|---|---|
| 5-th percentile | 0.002777777778 |
| Q1 | 0.012 |
| median | 0.06923076923 |
| Q3 | 0.1722222222 |
| 95-th percentile | 1 |
| Maximum | 1.251282051 |
| Range | 1.248717949 |
| Interquartile range (IQR) | 0.1602222222 |
Descriptive statistics
| Standard deviation | 0.3297116756 |
|---|---|
| Coefficient of variation (CV) | 1.496805905 |
| Kurtosis | 1.024747933 |
| Mean | 0.2202768405 |
| Median Absolute Deviation (MAD) | 0.2521903989 |
| Skewness | 1.612337493 |
| Sum | 47380.22672 |
| Variance | 0.108709789 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0025641 0.00267094 0.00305556 0.00356061 0.00381702 ... 0.98666667 0.99083333 0.99583333 1.00357143 1.25128205], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 1 | 23750 | 11.0% | |
| 0.002564102564 | 8753 | 4.1% | |
| 0.003846153846 | 7785 | 3.6% | |
| 0.007692307692 | 7232 | 3.4% | |
| 0.002777777778 | 5097 | 2.4% | |
| 0.004 | 4548 | 2.1% | |
| 0.01 | 4170 | 1.9% | |
| 0.01538461538 | 4131 | 1.9% | |
| 0.008 | 3407 | 1.6% | |
| 0.02307692308 | 3387 | 1.6% | |
| Other values (595) | 142834 | 66.4% |
| Value | Count | Frequency (%) | |
| 0.002564102564 | 8753 | 4.1% | |
| 0.002777777778 | 5097 | 2.4% | |
| 0.003333333333 | 8 | < 0.1% | |
| 0.003787878788 | 49 | < 0.1% | |
| 0.003846153846 | 7785 | 3.6% |
| Value | Count | Frequency (%) | |
| 1.251282051 | 1 | < 0.1% | |
| 1.01 | 7 | < 0.1% | |
| 1.007142857 | 1 | < 0.1% | |
| 1 | 23750 | 11.0% | |
| 0.9916666667 | 18 | < 0.1% |
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1685216695956187 |
|---|---|
| Minimum | 0 |
| Maximum | 4 |
| Zeros | 3665 |
| Zeros (%) | 1.7% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 4 |
| Range | 4 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.063133396 |
|---|---|
| Coefficient of variation (CV) | 0.490257216 |
| Kurtosis | -1.500647419 |
| Mean | 2.16852167 |
| Median Absolute Deviation (MAD) | 1.008188777 |
| Skewness | -0.1720879326 |
| Sum | 466436 |
| Variance | 1.130252619 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 1.5 2.5 3.5 4. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 3 | 106467 | 49.5% | |
| 1 | 84623 | 39.3% | |
| 4 | 10867 | 5.1% | |
| 2 | 9472 | 4.4% | |
| 0 | 3665 | 1.7% |
| Value | Count | Frequency (%) | |
| 0 | 3665 | 1.7% | |
| 1 | 84623 | 39.3% | |
| 2 | 9472 | 4.4% | |
| 3 | 106467 | 49.5% | |
| 4 | 10867 | 5.1% |
| Value | Count | Frequency (%) | |
| 4 | 10867 | 5.1% | |
| 3 | 106467 | 49.5% | |
| 2 | 9472 | 4.4% | |
| 1 | 84623 | 39.3% | |
| 0 | 3665 | 1.7% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| df_index | 회원번호 | 회원이름 | 주소 | 상품금액 | 총불입액 | 해약금액 | 담당자 | 연체횟수 | 성별 | 나이 | 진행률 | 상태 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0022A00001 | 이옥성590318 | 경기 | 0.208283 | 0.491725 | 0.501548 | 더피플라이프 | 0.008333 | 0 | 0.404040 | 1.00 | 0 |
| 1 | 1 | 0072A00001 | 안성열581125 | 경기 | 0.208283 | 0.491725 | 0.000000 | 더피플라이프 | 0.008333 | 0 | 0.414141 | 1.00 | 2 |
| 2 | 2 | 0072A00002 | 배준택831121 | 부산 | 0.208283 | 0.334324 | 0.000000 | 더피플라이프 | 0.275000 | 0 | 0.161616 | 0.68 | 3 |
| 3 | 3 | 0072A00003 | 배민규821023 | 울산 | 0.208283 | 0.491725 | 0.000000 | 더피플라이프 | 0.008333 | 0 | 0.171717 | 1.00 | 2 |
| 4 | 4 | 0072A00006 | 최금순340728 | 경기 | 0.208283 | 0.491725 | 0.000000 | 더피플라이프 | 0.008333 | 1 | 0.656566 | 1.00 | 2 |
| 5 | 5 | 0072A00007 | 주병오520206 | 서울 | 0.208283 | 0.344162 | 0.312693 | 더피플라이프 | 0.258333 | 0 | 0.474747 | 0.70 | 1 |
| 6 | 6 | 0072A00021 | 정성제760210 | 서울 | 0.208283 | 0.491725 | 0.000000 | 더피플라이프 | 0.008333 | 0 | 0.232323 | 1.00 | 4 |
| 7 | 7 | 0072A00022 | 신영주541016 | 충청 | 0.208283 | 0.491725 | 0.000000 | 더피플라이프 | 0.008333 | 1 | 0.454545 | 1.00 | 4 |
| 8 | 8 | 0072A00026 | 윤일선521001 | 서울 | 0.208283 | 0.472050 | 0.481424 | 더피플라이프 | 0.041667 | 1 | 0.474747 | 0.96 | 1 |
| 9 | 9 | 0072A00027 | 김건용530815 | 서울 | 0.208283 | 0.491725 | 0.000000 | 더피플라이프 | 0.008333 | 1 | 0.464646 | 1.00 | 4 |
Last rows
| df_index | 회원번호 | 회원이름 | 주소 | 상품금액 | 총불입액 | 해약금액 | 담당자 | 연체횟수 | 성별 | 나이 | 진행률 | 상태 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 215084 | 234600 | U022A21089 | 백쌍순320814 | 부산 | 0.208283 | 0.285136 | 0.235294 | 더피플라이프 | 0.358333 | 1 | 0.676768 | 0.580000 | 1 |
| 215085 | 234601 | U022A21090 | 손희락520228 | 부산 | 0.208283 | 0.408106 | 0.416151 | 더피플라이프 | 0.150000 | 0 | 0.474747 | 0.830000 | 1 |
| 215086 | 234602 | U022A21305 | 길준분660312 | 부산 | 0.208283 | 0.265461 | 0.213622 | 더피플라이프 | 0.391667 | 1 | 0.333333 | 0.540000 | 1 |
| 215087 | 234605 | U022A21379 | 김관수370127 | 경남 | 0.208283 | 0.098222 | 0.000000 | 더피플라이프 | 0.675000 | 0 | 0.626263 | 0.200000 | 1 |
| 215088 | 234606 | U022A22154 | 김순옥561026 | 부산 | 0.208283 | 0.014603 | 0.000000 | 더피플라이프 | 0.816667 | 1 | 0.434343 | 0.030000 | 1 |
| 215089 | 234607 | U022A22155 | 조길찬761012 | 부산 | 0.208283 | 0.014603 | 0.000000 | 더피플라이프 | 0.816667 | 0 | 0.232323 | 0.030000 | 1 |
| 215090 | 234609 | U244A00803 | 배상호820807 | 부산 | 0.208283 | 0.352359 | 0.319917 | 더피플라이프 | 0.150000 | 0 | 0.171717 | 0.716667 | 1 |
| 215091 | 234610 | U244A00804 | 배규태830402 | 부산 | 0.208283 | 0.491725 | 0.000000 | 더피플라이프 | 0.008333 | 0 | 0.161616 | 1.000000 | 4 |
| 215092 | 234611 | U244A00805 | 김군자490809 | 부산 | 0.208283 | 0.008044 | 0.000000 | 더피플라이프 | 0.500000 | 1 | 0.505051 | 0.016667 | 1 |
| 215093 | 234612 | U244A00806 | 정일선540801 | 부산 | 0.208283 | 0.098222 | 0.000000 | 더피플라이프 | 0.408333 | 1 | 0.454545 | 0.200000 | 1 |